5th international conference
Leveraging Large Language Model for Intelligent Log Processing and Autonomous Debugging in Cloud AI Platforms
With the increasing complexity and rapid expansion of the scale of AI systems in cloud platforms, the log data generated during system operation is massive, unstructured, and semantically ambiguous, which brings great challenges to fault location and system self-repair. In order to solve this problem, this paper proposes an intelligent log processing and automatic debugging framework based on Large Language Model (LLM), named Intelligent Debugger (LLM-ID). This method is extended on the basis of the existing pre-trained Transformer model, and integrates a multi-stage semantic inference mechanism to realize the context understanding of system logs and the automatic reconstruction of fault chains. Firstly, the system log is dynamically structured, and the unsupervised clustering and embedding mechanism is used to extract the event template and semantic schema. Subsequently, the fine-tuned LLM combined with the multi-round attention mechanism to perform contextual reasoning on the log sequence to generate potential fault assumptions and root cause paths. Furthermore, this paper introduces a reinforcement learning-based policy-guided recovery planner, which is driven by the remediation strategy generated by LLM to support dynamic decision-making and adaptive debugging in the cloud environment. Compared with the existing rule engine or traditional log analysis system, the proposed model has stronger semantic understanding ability, continuous learning ability and heterogeneous environment adaptability. Experiments on the cloud platform log dataset show that LLM-ID improves the fault location accuracy by 16.2%, which is significantly better than the current mainstream methods
- North America > United States > Illinois > Champaign County > Champaign (0.04)
- Asia > Thailand > Bangkok > Bangkok (0.04)
- Asia > Middle East > Jordan (0.04)
- Information Technology > Services (1.00)
- Information Technology > Security & Privacy (1.00)
- Health & Medicine (1.00)
Mixed-Precision Graph Neural Quantization for Low Bit Large Language Models
Liu, Wanlong, Xiao, Yichen, Zeng, Dingyi, Zhao, Hongyang, Chen, Wenyu, Zhang, Malu
Post-Training Quantization (PTQ) is pivotal for deploying large language models (LLMs) within resource-limited settings by significantly reducing resource demands. However, existing PTQ strategies underperform at low bit levels < 3 bits due to the significant difference between the quantized and original weights. To enhance the quantization performance at low bit widths, we introduce a Mixed-precision Graph Neural PTQ (MG-PTQ) approach, employing a graph neural network (GNN) module to capture dependencies among weights and adaptively assign quantization bit-widths. Through the information propagation of the GNN module, our method more effectively captures dependencies among target weights, leading to a more accurate assessment of weight importance and optimized allocation of quantization strategies. Extensive experiments on the WikiText2 and C4 datasets demonstrate that our MG-PTQ method outperforms previous state-of-the-art PTQ method GPTQ, setting new benchmarks for quantization performance under low-bit conditions.
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future Directions
Chen, Jiayu, Ganguly, Bhargav, Xu, Yang, Mei, Yongsheng, Lan, Tian, Aggarwal, Vaneet
Deep generative models (DGMs) have demonstrated great success across various domains, particularly in generating texts, images, and videos using models trained from offline data. Similarly, data-driven decision-making and robotic control also necessitate learning a generator function from the offline data to serve as the strategy or policy. In this case, applying deep generative models in offline policy learning exhibits great potential, and numerous studies have explored in this direction. However, this field still lacks a comprehensive review and so developments of different branches are relatively independent. In this paper, we provide the first systematic review on the applications of deep generative models for offline policy learning. In particular, we cover five mainstream deep generative models, including Variational Auto-Encoders, Generative Adversarial Networks, Normalizing Flows, Transformers, and Diffusion Models, and their applications in both offline reinforcement learning (offline RL) and imitation learning (IL). Offline RL and IL are two main branches of offline policy learning and are widely-adopted techniques for sequential decision-making. Notably, for each type of DGM-based offline policy learning, we distill its fundamental scheme, categorize related works based on the usage of the DGM, and sort out the development process of algorithms in that field. Subsequent to the main content, we provide in-depth discussions on deep generative models and offline policy learning as a summary, based on which we present our perspectives on future research directions. This work offers a hands-on reference for the research progress in deep generative models for offline policy learning, and aims to inspire improved DGM-based offline RL or IL algorithms. For convenience, we maintain a paper list on https://github.com/LucasCJYSDL/DGMs-for-Offline-Policy-Learning.
- North America > United States > Indiana > Tippecanoe County > West Lafayette (0.04)
- North America > United States > Indiana > Tippecanoe County > Lafayette (0.04)
- North America > United States > Montana (0.04)
- (10 more...)
- Research Report (1.00)
- Overview (1.00)
- Instructional Material > Course Syllabus & Notes (0.92)
- Leisure & Entertainment > Games > Computer Games (1.00)
- Education (1.00)
- Transportation > Ground > Road (0.67)
- (2 more...)
Accuracy Evaluation of a Lightweight Analytic Vehicle Dynamics Model for Maneuver Planning
Ziehn, J. R., Ruf, M., Roschani, M., Beyerer, J.
Models for vehicle dynamics play an important role in maneuver planning for automated driving. They are used to derive trajectories from given control inputs, or to evaluate a given trajectory in terms of constraint violation or optimality criteria such as safety, comfort or ecology. Depending on the computation process, models with different assumptions and levels of detail are used; since maneuver planning usually has strong requirements for computation speed at a potentially high number of trajectory evaluations per planning cycle, most of the applied models aim to reduce complexity by implicitly or explicitly introducing simplifying assumptions. While evaluations show that these assumptions may be sufficiently valid under typical conditions, their effect has yet to be studied conclusively. We propose a model for vehicle dynamics that is convenient for maneuver planning by supporting both an analytic approach of extracting parameters from a given trajectory, and a generative approach of establishing a trajectory from given control inputs. Both applications of the model are evaluated in real-world test drives under dynamic conditions, both on a closed-off test track and on public roads, and effects arising from the simplifying assumptions are analyzed.
- Europe > Germany > Baden-Württemberg > Karlsruhe Region > Karlsruhe (0.05)
- Asia > Singapore > Central Region > Singapore (0.05)
- Asia > South Korea > Seoul > Seoul (0.04)
- Asia > China (0.04)
- Transportation > Ground > Road (1.00)
- Automobiles & Trucks (1.00)
Advanced Feedback Linearization Control for Tiltrotor UAVs: Gait Plan, Controller Design, and Stability Analysis
Three challenges, however, can hinder the application of Feedback Linearization: over-intensive control signals, singular decoupling matrix, and saturation. Activating any of these three issues can challenge the stability proof. To solve these three challenges, first, this research proposed the drone gait plan. The gait plan was initially used to figure out the control problems in quadruped (four-legged) robots; applying this approach, accompanied by Feedback Linearization, the quality of the control signals was enhanced. Then, we proposed the concept of unacceptable attitude curves, which are not allowed for the tiltrotor to travel to. The Two Color Map Theorem was subsequently established to enlarge the supported attitude for the tiltrotor. These theories were employed in the tiltrotor tracking problem with different references. Notable improvements in the control signals were witnessed in the tiltrotor simulator. Finally, we explored the control theory, the stability proof of the novel mobile robot (tilt vehicle) stabilized by Feedback Linearization with saturation. Instead of adopting the tiltrotor model, which is over-complicated, we designed a conceptual mobile robot (tilt-car) to analyze the stability proof. The stability proof (stable in the sense of Lyapunov) was found for a mobile robot (tilt vehicle) controlled by Feedback Linearization with saturation for the first time. The success tracking result with the promising control signals in the tiltrotor simulator demonstrates the advances of our control method. Also, the Lyapunov candidate and the tracking result in the mobile robot (tilt-car) simulator confirm our deductions of the stability proof. These results reveal that these three challenges in Feedback Linearization are solved, to some extents.
- Transportation > Passenger (1.00)
- Transportation > Air (1.00)
- Aerospace & Defense > Aircraft (1.00)
- Energy > Oil & Gas > Upstream (0.67)
Generative Adversarial Training Can Improve Neural Language Models
Movahedi, Sajad, Shakery, Azadeh
While deep learning in the form of recurrent neural networks (RNNs) has caused a significant improvement in neural language modeling, the fact that they are extremely prone to overfitting is still a mainly unresolved issue. In this paper we propose a regularization method based on generative adversarial networks (GANs) and adversarial training (AT), that can prevent overfitting in neural language models. Unlike common adversarial training methods such as the fast gradient sign method (FGSM) that require a second back-propagation through time, and therefore effectively require at least twice the amount of time for regular training, the overhead of our method does not exceed more than 20% of the training of the baselines.
- North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
- North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.07)
- Europe > France (0.06)
- (4 more...)
CSPLib: Twenty Years On
In 1999, we introduced CSPLib, a benchmark library for the constraints community. Our CP-1999 poster paper about CSPLib discussed the advantages and disadvantages of building such a library. Unlike some other domains such as theorem proving, or machine learning, representation was then and remains today a major issue in the success or failure to solve problems. Benchmarks in CSPLib are therefore specified in natural language as this allows users to find good representations for themselves. The community responded positively and CSPLib has become a valuable resource but, as we discuss here, we cannot rest.
- Oceania > Australia (0.05)
- North America > United States > Virginia > Alexandria County > Alexandria (0.05)
- Europe > United Kingdom > Scotland > Fife > St. Andrews (0.05)
- Europe > Germany (0.05)
Game Metrics Without Players: Strategies for Understanding Game Artifacts
Nelson, Mark J. (IT University of Copenhagen)
Game metrics are an approach to understanding games and gameplay by analyzing and visualizing information collected from players in playtests. This paper proposes that another source of metrics is the game itself, and that not all information needs to (or ought to) come from empirical playtests. I discuss seven strategies for extracting information from games, and discuss how the information retrieved in this manner relates to empirical playtest metrics---which it differs from but can often complement.
- Europe > Denmark > Capital Region > Copenhagen (0.04)
- North America > United States > Massachusetts (0.04)